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Abstract. Given a set C of binary n-tuples and c £ C, how many bits 
of c suffice to distinguish it from the other elements in C ? We shed 
new light on this old combinatorial problem and improve on previously 
known bounds. 



1 Introduction 

Let C C {0, 1}" be a set of distinct binary vectors that we will call a code, and 
denote by [n] = {1, 2, ...n} the set of coordinate positions. It is standard in coding 
theory to ask for codes (or sets) C such that every codeword c e C is as different 
as possible from all the other codewords. The most usual interpretation of this 
is that every codeword c has a large Hamming distance to all other codewords, 
and the associated combinatorial question is to determine the maximum size 
of a code that has a given minimal Hamming distance d. The point of view of 
the present paper is to consider that "a codeword c is as different as possible 
from all the other codewords" means that there exists a small subset W C [n] of 
coordinates such that c differs from every other codeword in W . Put differently, 
it is possible to single out c from all the other codewords by focusing attention 
on a small subset of coordinates. More precisely, for x e {0, 1}", and W C [n] 
let us define the projection ttw 

t:w :{0,1}["1-.{0,1}^ 

X (-> {Xi)i^w 

and let us say that W is a witness set (or a witness for short) for c S C if 
'''w(c) 7^ TTwic') for every c' £ C, c' . Codes for which every codeword has a 
small witness set arise in a variety of contexts, in particular in machine learning 
theory [1, 3, 4] where a witness set is also called a specifying set or a discriminant: 
see [5, Ch. 12] for a short survey of known results and also [2] and references 
therein for a more recent discussion of this topic and some variations. 



Let us now say that a code has the tw-witness property, or is a w-witness 
code, if every one of its codewords has a witness set of size w. Our concern is to 
study the maximum possible cardinahty f{n,w) of a w-witness code of length 
n. We shall give improved upper and lower bounds on /(n, w) that almost meet. 

The paper is organised as follows. Section 2 gives some easy facts for reference. 
Section 3 is devoted to upper bounds on f{n, w) and introduces our main result, 
namely Theorem 2. Section 4 is devoted to constant weight w-witness codes, and 
we derive precise values of the cardinality of optimal codes. Section 5 studies 
mean values for the number of witness sets of a codeword and the number of 
codewords that have a given witness set. Section 6 is devoted to constructions 
of large w-witness codes, sometimes giving improved lower values of f{n,w). 
Finally, Section 7 concludes with some open problems. 

2 Easy and known facts 

Let us start by mentioning two self-evident facts 

— If C is a w-witncss code, so is any translate C + x, 

— f{n, w) is an increasing function of n and tu. 

Continue with the following example. Let C be the set of all n vectors of length 
n and weight 1. Then every codeword of C has a witness of size 1, namely its 
support. Note the dramatic change for the slightly different code C U {0}. Now 
the all-zero vector has no witness set of size less than n. Bondy [3] shows 
however that if \C\ < n, then C is a w-witness code with w < |C| — 1 and 
furthermore C is a uniform w-witness code, meaning that there exists a single 
subset of [n] of size w that is a witness set for all codewords. 

We clearly have the upper bound |C| < 2"' for uniform w-witness codes. For 
ordinary w-witness codes however, the best known upper bound is, [5, Proposi- 
tion 12.2], 

/(n,w)<2»Q. (1) 

The proof is simple and consists in applying the pigeon-hole principle. A subset 
of [n] can be a witness set for at most 2™ codewords and there are at most (^) 
witness sets. 

We also have the following lower bound on /(n, w), based on a trivial con- 
struction of a w-witness code. 

Proposition 1. We have: f{n,w) > (^). 

Proof. Let C = ('^') be the set of all vectors of weight w. Notice that for all 
c G C, W(c) = supportic) is a witness set of c. 

Note that the problem is essentially solved for w > n/2; since /(n, w) is 
increasing with w, we then have: 

2" > /(n,w) > f{n,n/2) > ( " ) > 2"/(2n)V2. 



We shall therefore focus in the sequel on the case w < n/2. 
In the next section we improve the upper bound (1) to a quantity that comes 
close to the lower bound of Proposition 1. 

3 An improved upper bound 

The key result is the following. 

Theorem 1. Let g{n,w) = /(n, Then, for fixed w, g{n,w) is a decreas- 

ing function of n. That is: 

n>v>w => g{n,w) < g{v,w). 

Proof. Let C be a binary code of length n having the w-witness property, with 
maximal cardinality |C| = f{n,w). Fix a choice function </) : C — > ('^') such 
that for any c G C, 4)(c) is a witness for c. For any V G (^"'), denote by Cy the 
subset of C formed by the c satisfying (p{c) C V. Remark that the projection Try 
is injective on Cy, since each element of Cy has a witness in V. Then 7ry(Cy) 
also has the w-witness property. 

Remark now that if V is uniformly distributed in (^"') and W is uniformly 
distributed in ('^^) and independent from V, then for any function : ('^1) — > M 
one has 

Ewii^iW)) = Ev{EwmW) I W C V)), (2) 

where we denote by Ew{'4'{W)) the mean value (or expectation) of ^piW) as W 
varies in and so on. 

We appty this with V(l^) = \(I>~'^{W)\ to find 

= Ew{\r\w)\) 

= Ev{Ew{\(l>-\W)\\W ^V)) 

= ^-((:)"'E„,pr'wi) 

=..((:)"",cv,) 

< g{v,w) 

the last inequality because 7ry(Cy) is a binary code of length v having the w- 
witness property. 



Remark: It would be interesting to try to improve Theorem 1 using some unex- 
ploited aspects of the above proof, such as the fact that the choice function ^ may 
be non-unique, or the fact that the last inequality not only holds in mean value, 
but for all V. For instance, suppose there is a codeword c G C (with C optimal as 
in the proof) that admits two distinct witnesses W and W , with W (f. W. Let 
^ be a choice function with ^(c) — W, and let <j)' be the choice function that co- 
incides everywhere with </>, except for <j)'{c) = W. Let V contain W but not W. 
If we denote by Cy the subcode obtained as Cy but using (j)' as choice function, 
then Cy = Cv U {c} (disjoint union), so |7ry(Cy)| = |7ry(Cy)| — 1 < f{v,w), 
and g{n,w) < g{v,w). 

Theorem 1 has a number of consequences: the following is straightforward. 
Corollary 1. For fixed w, the limit 

hm g{n,w) = 

n— >cxD I I 

\ui/ 

exists. 

The following theorem gives an improved upper bound on f{n,w). 
Theorem 2. For w < n/2, we have the upper hound: 

f{n,w) < 2^1/2 

Proof. Choose v = 2w and use f{v,w) < 2"; then f{n,w) < 0/(2w, w)/(^J) 
and the result follows by Stirling's approximation. 

Set w — um and denote by h{x) the binary entropy function 

h{x) = — a;log2 x — (1 — x) log2(l — x). 

Theorem 2 together with Proposition 1 yield: 

Corollary 2. We have 

lim„^oo ^ log2 /(n, wn) = hiuj) forO<u)< 1/2 

= 1 for 1/2 < a; < 1. 

4 Constant-weight codes 

Denote now hy f{n,w,k) the maximal size of a w- witness code with codewords 
of weight k. The following result is proved using a folklore method usually at- 
tributed to Bassalygo and Elias, valid when the required property is invariant 
under some group operation. 




Proposition 2. We have: 



max f{n,w,k) < f{n, w) < mm . 

[k) 



Proof. The lower bound is trivial. 

For the upper bound, fix k, pick an optimal u>- witness code C and consider its 
2" translates by all possible vectors. Every n-tuple, in particular those of weight 
k, occurs exactly |C| times in the union of the translates; hence there exists a 
translate (also an optimal w;-witness code of size f{n,w) - see the remark at 
the beginning of Section 2) containing at least the average number |C|(^)2~" of 
vectors of weight k. Since k was arbitrary, the result follows. 

Wc now deduce from the previous proposition the exact value of the function 
/(n, w, k) in some cases. 

Corollary 3. For constant-weight codes we have: 

— If k < w < n/2 then f{n, w, k) — (^) and an optimal code is given by Sk{0), 
the Hamming sphere of radius k centered on 0. 

— If n — k < w < n/2, then f{n,w,n — k) = (^) and an optimal code is given 
by the sphere 5^(1). 

Proof. If k < w < n/2, we have the following series of inequalities: 

Q <f{n,k,k)<f{n,w,k)<(^^. 
If n — k <w < n/2, perform wordwise complementation. 



5 Some mean values 

Let C be a binary code of length n (not necessarily having the w-witness prop- 
erty). Let 

Wc,w ■■ C 2(»'), Wc,^(c) = {W e (^^^ : W is& witness for c}, 

and symmetrically, 

Cc,w ■■ (^^^ 2<^, Cc,w{W) = {ceC : is a witness for c}. 

Remark that if C" C C is a subcode, then Wc',w{c) D VVc,u)(c) for any 
c e C, while Cc',u,{W) D {C n Cc,u,{W)) for any W € {^l^) . 



Lemma 1. With these notations, the mean values of \yVc,w\ o,nd \Cc,w\ ire 
related by 

\C\E,{\WcAc)\) = (^iyw{\CcAW)\), 

or equivalently 

\C\ _ Ewi\CcAW)\) 
O E,{\WcAc)\) ■ 

Proof. Double count the set c) e ('^1) x C : W is a witness for c|. 

Now let 7(C, w) = Ew{\Cc.w{W)\) and let 7^(n, w) be the maximum possible 
value of 7(C, w) for C a binary code of length n, and 7+"'"(n, w) be the maximum 
possible value of 7(C, w) for C a binary code of length n having the t«-witness 
property. 

Lemma 2. With these notations, one has 7+(n, w) = 7++(n, w). 

Proof. By construction 7+(n,7i;) > 7++(n, w). On the other hand, let C be 
a binary code of length n with 7(C, w) = 7+(n,w), and let then C" bo the 
subcode of C formed by the c having at least one witness of size w, i.e. C = 
UvyeC"!) ^c,w{W). Then C has the w-witness property, and 

'y~^~^{n,w) > 7((7',w) > ^{C,w) = 7"'"(n, w). 

The technique of the proof of Proposition 1 immediately adapts to give: 

Proposition 3. With these notations, w being fixed, ^^{n,w) is a decreasing 
function of n. That is: 

n> V > w => 'y~^{n,w) < j~^{v, w). 

Proof. Let C be a binary code of length n with ■y{C,w) = 7"'"(n, w). For V £ 
denote by Cy the subset of C formed by the c having at least one witness of 
size w included in V, i.e. Cy = IJvkg(^) ^C,w{W). Then C'y has the w-witness 
property, Cc,w{W) C Cc[^,w{W) for any W C V, and wv is injective on Cy. 
Using this and (2), one gets: 

^+{n,w) = Ew{\CcAW)\) 

= EyiEw{\CcAW)\\W CV)) 

< Ev{Ew{ \Cc'^AW)\ \W^V)) 

= Ev{Ew{ \C^,ic^)AW)\ \WCV)) 
= Ev{^{'jrv{C^),w)) 

< 'y~^{v, w). 



6 Constructions 



6.1 A generic construction 

Let T C (<^) be a set of subsets of {1, ... , n} all having cardinality at most w. 

Let Cj: C {0, 1}" be the set of words having support included in one and 
only one W ^ T . Then: 

Proposition 4. With these notations, Cjr has the w-witness property. 

Proof. For each c e C^, let Wc be the unique W G containing the support of 
c. Then Wc is a witness for c. 

Example 1. For jC" = (N) we find Cjp = S'^(O), and 

Example 1'. Suppose w > n/2. Then for JT = we find Cjp = S'„/2(0), 
and 

(where for ease of notation we write n/2 instead of [n/2\). 

Example 2. For = {1^} with \W\ <wwe find Cjf = {0, 1}^ (where we 
see {0, 1}^ as a subset of {0, 1}" by extension by on the other coordinates), 
and 

f{n,w)>\C^\ = 2^. 

Exemple 3. Let !F be the set of (supports of) words of a code with constant 

weight w and minimal distance d (one can suppose d even). Then for all distinct 
W,W' G one has \ W nW'l < w - d/2, so for all W GJ^, the code Cj, contains 
all words of weight larger than w — d/2 supported in W. This implies : 

Corollary 4. For all d one has 

f{n,w) > A{n,d,w)B{w,d/2 - 1) 

where: 

- A{n, d, w) is the maximal cardinality of a code of length n with minimal 
distance at least d and constant weight w 

- B{w,r) = ^i<i<r(Y) cardinality of the hall of radius r in {0, l}*". 

For d = 2, this construction gives the sphere again. For d = 4, this gives 
f{n, w) > {1 + w)A{n, d, w). We consider the following special values: 

- n = 4, d = 4, w = 2: A(4, 4, 2) = 2 

- n = 8,d = 4,w = 4: A(8, 4, 4) = 14 

- n= 12, rf = 4, «; = 6: A(12, 4, 6) = 132 



the last two being obtained with T the Steiner system 5(3, 4, 8) and ^(S, 6, 12) 
respectively. 

The corresponding codes C^f have same cardinality as the sphere (2x3 = 6, 
14 X 5 = 70 and 132 x 7 = 924 respectively), but they are not translates of 
a sphere. Indeed, when C is a (translate of a) sphere with w = n/2, one has 
Cc,tu(VF) = 2 for any window W G ('^1). On the other hand, for C = as 
above, one has by construction Cc,w (W^) = w + 1 for G .F. 



6.2 Another construction 

Let D C {0, 1}™ be a binary (non-linear) code of length w > n/2 and minimal 
weight at least 2w — n. 

Let Ci be the code of length n obtained by taking all words of length w that 
do not belong to D, and completing them with on the last n — w coordinates. 
Thus \Ci\ = 2-^ - \D\. 

Let C2 be the code of length n formed by the words c of weight exactly w, 
and such that the projection of c on the first w coordinates belongs to D. Thus 
if fife is the number of codewords of weight k in D, one finds IC2I = '^fe(^-fc)- 

Now let C be the (disjoint!) union of Ci and C2. Then C has the witness 
property. Indeed, let c G C. Then if c G Ci, c admits [w] as witness, while if 
c G C2, c admits its support as witness. 

As an illustration, let D be the sphere of radius w — t in {0, l}"", for t G 
{l,...,2i^}. Then 

/(..»).ici.2..(;:^)((»7')-i 

If w satisfies 2™ > (^"2) w <n—l, this improves on examples 1,1', and 
2 of the last subsection, in that one finds then 

/(n,H>|C|>max(Q,(^^^),2»). 

On the other hand, remark that Ci C {0, 1}["'1 and C2 C 5^(0), so that 
|C|<2- + (;). 



7 Conclusion and open problems 

We have determined the asymptotic size of optimal w- witness codes. A few issues 
remain open in the non-asymptotic case, among which: 

— When is the sphere 5*^(0) the/an optimal w-witness code? Do we have 
f{n,w) = (") for w < n/2 ? In particular do we have f{2w,w) = (^"') ? 

— For w > n/2, do we have /(n, w) < max( ,2"" + (l)) 7 

— Denoting by f{n, w, > d) the maximal size of a w;- witness code with minimum 
distance d, can the asymptotics of Proposition 2 be improved to 

— log2 f{n,u)n, > Sn) < h{w) ? 
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